An optimal speech enhancement under speech uncertainty probability and masking property of auditory system
نویسندگان
چکیده
Recently, I.Cohen has presented causal and noncausal algorithms to modify the classic decision-directed approach for prior SNR. It is well-known that prior SNR is critical to trade off the musical noise level and the audible clearness level in spectral subtraction speech enhancement. However, all these algorithms conflict with statistical signal model more or less. To adjust smoothing parameters which play an important role on the recursive procedure of prior SNR and noise spectrum estimate more reasonably, we present novel speech uncertainty state model which capitalizes on the masking property of auditory system, and propose a new modified approach which employs speech uncertainty probability to make automatic adaptation of smoothing parameters. Novel algorithm is capable of eliminating musical noise meanwhile lowering speech distortion by remaining original speech in the case of inaudible noise under masking threshold. Experiments confirm that novel algorithm is superior to classic methods, particularly at low SNR environment.
منابع مشابه
Speech Enhancement using Laplacian Mixture Model under Signal Presence Uncertainty
In this paper an estimator for speech enhancement based on Laplacian Mixture Model has been proposed. The proposed method, estimates the complex DFT coefficients of clean speech from noisy speech using the MMSE estimator, when the clean speech DFT coefficients are supposed mixture of Laplacians and the DFT coefficients of noise are assumed zero-mean Gaussian distribution. Furthermore, the MMS...
متن کاملA Generalized Time–Frequency Subtraction Method for Robust Speech Enhancement Based on Wavelet Filter Banks Modeling of Human Auditory System
We present a new speech enhancement scheme for a single-microphone system to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-tonoise ratio. A psychoacoustic model is incorporated into the generalized perceptual wavelet denoising method to reduce the residual noise and improve the intelligibility of speech. The proposed method is a generalized tim...
متن کاملSpeech enhancement using a wiener denoising technique and musical noise reduction
Speech enhancement methods using spectral subtraction have the drawback of generating an annoying residual noise with musical character. In this paper a frequency domain optimal linear estimator with perceptual post filtering is proposed which incorporates the masking properties of the human auditory system to make the residual noise distortion inaudible. The performance of the proposed enhance...
متن کاملPerceptual Speech Enhancement Using a Hilbert Transform Based Time-Frequency Representation of Speech
A new Time-Frequency (TF) representation of speech signal is introduced and used for speech enhancement. TF representation and speech enhancement algorithm are both based on perceptual properties of human auditory system in which the concept of band analysis is exploited. TF representation is carried out by the means of analytic decomposition of speech signal in the hearing Critical Bands (CB) ...
متن کاملSpeech Enhancement using Temporal Masking and Fractional Bark Gammatone Filters
A speech enhancement technique based on the temporal masking properties of the human auditory system is presented. The noisy signal is divided into a number of sub-bands with fractional bark accuracy, and the sub-band signals are individually and adaptively weighted in the time domain according to a short-term temporal masking threshold to noise ratio estimate in each subband. Objective measure...
متن کامل